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SUMMARY 

Thymic epithelial cells (TECs) play a critical role in T cell maturation and tolerance induction. The generation of TECs from in vitro 
differentiation of human pluripotent stem cells (PSCs) provides a platform on which to study the mechanisms of this interaction and 
has implications for immune reconstitution. To facilitate analysis of PSC-derived TECs, we generated hESC reporter lines in which 
sequences encoding GFP were targeted to FOXN1, a gene required for TEC development. Using this FOXNl GFP/w line as a readout, we 
developed a reproducible protocol for generating FOXNl-GFP + thymic endoderm cells. Transcriptional profiling and flow cytometry 
identified integrin-(34 (ITGB4, CD 104) and HLA-DR as markers that could be used in combination with EpCAM to selectively purify 
FOXNl + TEC progenitors from differentiating cultures of unmanipulated PSCs. Human FOXNl + TEC progenitors generated from 
PSCs facilitate the study of thymus biology and are a valuable resource for future applications in regenerative medicine. 



INTRODUCTION 

T cells undergo most of their development in the thymus, 
the primary lymphoid organ that regulates their differ- 
entiation and maturation from blood-borne bone- 
marrow-derived precursors and appropriate selection for 
the induction of self-tolerance (Anderson et al., 2007). 
Thymic function is critically dependent on thymic epithe- 
lial cells (TECs), the most abundant cellular constituent of 
the stromal microenvironment. TECs are classified as two 
morphologically and functionally distinct subsets based 
on their localization to the thymic cortex (cTECs) or 
medulla (mTECs). TEC development and identity require 
the forkhead-box transcription factor Foxnl, which, in 
the mouse, demarcates the prospective thymic primor- 
dium within the third pharyngeal pouch at embryonic 
day 11.5 (Ell. 5). Loss of Foxnl results in a "nude" pheno- 
type in mice and rats (Nehls et al., 1994, 1996) and in 
humans (Pignata et al., 1996), characterized by congenital 
hairlessness and defective TEC differentiation, the latter of 
which results in the absence of functional T cells and severe 
immunodeficiency 

T cell insufficiency is associated with other congenital 
thymic hypoplasias, such as DiGeorge syndrome (Jerome 



and Papaioannou, 2001), age-related thymus atrophy, or 
cytoablative therapy-induced thymic involution. In the 
last case, patients undergoing high-dose chemotherapy 
often experience chronic immunosuppression, predispos- 
ing them to a host of opportunistic infections. In all of 
these instances, replenishment of the thymic epithelial 
compartment might provide an avenue to augment 
thymus function and boost T cell output. 

The derivation of tissues by in vitro differentiation of 
pluripotent stem cells (PSCs) has been advanced as a 
platform for use in the emerging fields of cell therapy and 
regenerative medicine. PSCs possess the capacity to give 
rise to the three embryonic germ layers, including the 
definitive endoderm, the precursor of thymic epithelium. 
PSC differentiation protocols that promote definitive 
endoderm formation (D' Amour et al., 2005; Kubo et al., 
2004; Tada et al., 2005; Yasunaga et al., 2005) and the 
subsequent generation of posterior foregut derivatives, 
including pancreatic cells (D 'Amour et al., 2006; Kroon 
et al., 2008) and hepatocytes (Cai et al., 2007; Gouon-Evans 
et al., 2006), are well established. Similarly, strategies 
designed to generate anterior foregut derivatives, such as 
the lung, thyroid, and thymus, have also been reported 
(Green et al., 2011; Lai and Jin, 2009; Longmire et al., 
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Figure 1. Targeted F0XN1 GFP/W hESCs Facilitate Monitoring of Thymic Differentiation toward F0XN1-GFP + Cells 

(A) Targeting strategy used to insert sequences encoding GFP into exon 2 (e2) of the F0XN1 gene Locus via homologous recombination. The 
selection cassette included a neomycin resistance gene (neo R ) flanked by loxP sites (black triangles). Swal restriction endonuclease was 
used to linearize the vector prior to electrop oration into hESCs. The locations of PCR primer pairs (a, b, and c) and a GFP probe (blue line) 
used to characterize targeted clones are also indicated. 

(B) PCR analyses of HES3-derived F0XNl GFP/w hESCs with primer pairs (a and b) spanning the junctions between the targeting vector and 
genomic DNA generated bands of 9.6 kb (5' end) and 3.9 kb (3' end) from targeted F0XNl GFPneoR/w , but not from wild-type hESCs. PCR 
analysis with primer pair c flanking the neo R gene generated bands of 3.7 kb compared to 5.4 kb, indicating excision of the neo R cassette. 
Southern blot analysis of Hindlll-digested F0XNl GFP/w genomic DNA using the GFP probe detected a single 4.2 kb fragment, demonstrating 
integration of a single copy of the F0XN1 -targeting vector. 

(C) Heatmap summarizing the results of flow cytometry experiments assessing the percentage of GFP + cells from days 30-35 embryoid 
bodies generated with the indicated BMP4 and Activin A concentrations. Data were derived from three independent experiments. FACS 
profiles representing selected conditions are shown on the right. 

(legend continued on next page) 
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2012). Recently, two groups reported methods for the 
generation of thymic endoderm from human pluripotent 
stem cells (Parent et al., 2013; Sun et al., 2013). Impor- 
tantly, these studies showed that differentiated mixed cul- 
tures containing thymic progenitors could mature in vivo 
to form grafts capable of supporting T cell development 
in nude {Foxnl~ f ~) or other immunocompromised mice. 
However, because of the lack of appropriate surface 
markers, it was not possible to dissect the contributions 
or requirements of the various cell types present in the 
differentiation cultures at the time of transplantation. 
This deficit could be partly remedied by the availability of 
FOXN1 reporter lines or surface markers that allow further 
fractionation of cultures containing FOXNl + cells. 

We generated FOXNl GFP/w human embryonic stem cell 
(hESC) reporter lines that were used to develop a robust 
serum-free protocol for the generation of FOXNl + thymic 
endodermal progenitors. We found that high levels of 
Activin A and KGF efficiently induced the differentiation 
of FOXNl + cells and that these cells expressed genes 
involved in endoderm and thymus development. Tran- 
scriptional profiling of purified FOXNl-GFP + cells allowed 
the identification of several combinations of cell-surface 
markers that could selectively isolate FOXNl + TEC pro- 
genitor populations derived from unmodified PSC lines. 
Collectively, these reagents and findings represent a valu- 
able resource for the further investigation of thymic devel- 
opment from pluripotent stem cells. 

RESULTS 

FOXNl GFP/w hESCs Facilitate Analysis of Thymic 
Differentiation In Vitro 

To facilitate analysis of thymic differentiation of PSCs, we 
used homologous recombination to target a GFP reporter 
gene to the endogenous FOXN1 locus in MEL1 (Millipore) 
and HES3 (Richards et al, 2002) hESCs (Figures 1A and IB; 
Figure S1A available online). Targeted FOXNl GFP/w hESCs 
retained expression of pluripotent stem cell markers, 
formed multilineage teratomas when transplanted into 
immunodeficient NOD/SCID mice, and maintained 
normal karyotypes (Figures S1B-S1D). In order to identify 
conditions that favored the formation of FOXNl + 
endodermal cells, we differentiated FOXNl GFP/w hESCs in 
serum-free media as spin embryoid bodies (EBs) (Ng et al., 
2008) and cross-titrated bone morphogenetic protein-4 
(BMP4) and Activin A, growth factors that activate 



signaling pathways required for the generation and 
patterning of mesendoderm and have been previously 
shown to promote PSC differentiation in vitro (D'Amour 
et al, 2005; Davis et al., 2008; Kubo et al, 2004; Tada 
et al., 2005; Yasunaga et al., 2005). This heatmap analysis 
employed an iterative process (Elliott et al., 2011) and 
identified combinations of BMP4 and Activin A that 
promoted the appearance of FOXNl-GFP + cells, which 
also costained with the endodermal marker EpCAM (Fig- 
ure 1C; Figures S2A-S2C). Based on these results and find- 
ings from experiments using different batches of Activin 
A (data not shown), for all subsequent experiments, we 
differentiated hESCs for 5 days in serum-free medium 
supplemented with 150 ng ml -1 Activin A alone (Fig- 
ure ID). These high Activin conditions were similar to 
those used in previously published differentiation pro- 
tocols where definitive endoderm derivatives were the 
desired outcome (D'Amour et al., 2006; Green et al., 
2011). Flow cytometric analysis of differentiating 
MIXLl GFP/w hESCs, which can be used to monitor mesen- 
doderm induction (Davis et al., 2008), indicated that at 
differentiation day 4, over 99% of cells were MIXL1-GFP + , 
arguing against the possibility that FOXNl-GFP + cells rep- 
resented ectodermally derived keratinocytes (Figure S3 A). 
Time course quantitative real-time PCR analysis of differen- 
tiation cultures demonstrated that from day 12, cells pro- 
gressively upregulated expression of the pharyngeal pouch 
markers HOXA3 and PAX9, as well as the thymic markers 
INVOLUCRIN, FGFR2 (the receptor for KGF), KERATIN-10, 
and FOXN1 itself. Expression of the general endodermal 
markers FOXA2 and SOX 17 declined from a peak at day 
12, consistent with expansion of a specialized anterior 
endoderm population that does not express these markers 
at high levels (Figure S3B). 

Keratinocyte growth factor (KGF or FGF7), a factor previ- 
ously implicated in thymic development, was required to 
expand the FOXNl-GFP + population once it emerged in 
culture, with the timing of the KGF supplementation 
affecting the percentage of GFP + cells obtained (Figures 
ID and IE). KGF is expressed in the thymus by the sur- 
rounding mesenchyme and by different thymocyte subsets 
(Rossi et al., 2007) and exerts a paracrine effect by binding 
to the FgfR2IIIb receptor, which, within the thymus, is 
expressed exclusively by TECs (Erickson et al., 2002). More- 
over, exogenous administration of KGF has been shown to 
increase thymic cellularity by enhancing the proliferation 
and function of TECs (Erickson et al., 2002; Min et al., 
2007; Rossi et al, 2007). After the addition of KGF, GFP + 



(D) Spin embryoid body protocol for generating F0XN1-GFP + cells. 

(E) Summary of flow cytometric data indicating the effects of KGF addition. Following EB culture with Activin A, KGF was either added at 
days 7, 14, and 21, days 14 and 21, or day 21. Differentiations were performed in APEL or BPEL medium with or without 50 ng ml -1 noggin 
at days 5 and 6. Error bars show SEM for three independent experiments. 
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cells could be visualized using fluorescence microscopy 
from approximately day 25 of differentiation. Flow cyto- 
metric analysis of cells differentiated as outlined in Fig- 
ure ID indicated that the first FOXNl-GFP + cells emerged 
from a PDGFRoc - and EpCAM + population as early as differ- 
entiation day 15 and increased in frequency thereafter 
(up to 38% GFP + at day 35) (Figure 2A). MEL 1 -derived 
FOXNl GFP/w hESCs differentiated with similar kinetics to 
the HES3-derived line, suggesting our protocol is applicable 
to other PSC lines (Figure 2A). Quantitative real-time PCR 
analysis of cells purified on the basis of GFP and EpCAM 
expression showed that FOXN1 transcripts were confined 
to the GFP + population (Figure 2B) ; confirming the fidelity 
of the GFP reporter gene. This conclusion was supported by 
intracellular flow cytometric analysis of FOXN1 protein 
expression in purified GFP + and GFP" cells (Figure S3C). 
Prolonged culture of aggregates of fluorescence-activated 
cell sorting (FACS) -purified GFP + EpCAM + cells indicated 
that GFP expression was retained for up to 3 weeks 
(Figure 2B). 

Gene Expression Analysis of hESC-Derived FOXNl + 
TEC Progenitors 

We next performed microarray analysis to survey genes 
expressed in the GFP + fraction and to search for markers 
that might identify FOXNl + cells in differentiating cultures 
derived from unmanipulated human PSCs. We first 
compared the global gene expression within the GFP + 
and GFP - EpCAM + epithelial subpopulations. This analysis 
identified 115 genes that were upregulated in the GFP + 
EpCAM + fraction compared to the GFP - EpCAM + fraction 
(Figure 3 A; see Tables SI and S2 for the full list of genes 
differentially upregulated by 3-fold), and many of these 
have been previously associated with thymus development 
and/or FOXN1 expression. These included the canonical 
HWT-signaling molecules WNT4 and WNT3 and the 
WNT regulator KRM2, the NOTCH ligands DILI, thought 
to be required for crosstalk in driving T-lymphopoiesis, 
and JAG2 (Candi et al., 2007), a downstream effector of 
the ANP63 isoform of the TP63 transcription factor, 
required for the proliferation of TECs and other stratified 
epithelia, human TEC type I cytokeratins KRT14, KRT15, 
and KRT16, and the chemokine CXCL14 whose transcript 
is expressed in fetal TECs and may aid in the recruitment 
of T cell precursors (Liu et al., 2005). Conversely, 213 
transcripts were preferentially upregulated in the GFP~ 
EpCAM + population (Figure 3A). These included HAND1 
and ISL1, genes expressed during cardiac morphogenesis 
and in neural crest derivatives, although ISL1 is also ex- 
pressed in pancreatic endoderm, other definitive endo- 
dermal markers, such as APOA1, SOX9, and TGFB1, and 
human TEC type II cytokeratins KRT6, KRT7, and KRT8. 
Interestingly, the chemokine CXCL12 was also upregulated 



within the GFP~EpCAM + population. CXCL12 is expressed 
within TECs and bone marrow niches, where it acts as a 
chemoattractant for T cell precursors, monocytes, and 
hematopoietic stem cells (HSCs) (Greenbaum et al., 2013; 
Liu et al, 2005). Overall, the GFP"EpCAM + population 
expressed genes associated with gene ontology (GO) cate- 
gories including "blood circulation" (p value < 0.002), "cir- 
culatory system process" (p value < 0.002), "cell migration" 
(p value < 0.003), and "cell motility" (p value < 0.006) (Fig- 
ure S4A). In contrast, GFP + EpCAM + cells were enriched for 
genes associated with FOXN1 expression, thymus develop- 
ment, or epithelial-related GO categories (Figure S4A). 
Although it is not possible to infer from gene expression 
data which genes are direct targets of FOXN1, the ability 
to generate and purify a large number of FOXNl + cells pro- 
vides an opportunity to examine this issue using tech- 
niques such as chromatin immunoprecipitation (ChIP). 
Last, we also compared gene expression of the GFP + 
EpCAM + and GFP EpCAM" populations (Figure S4B). 
Included among the transcripts preferentially expressed 
within GFP + EpCAM + cells were genes with a role in thymus 
development (KRM2, J AG 2, WNT4, BCL11B, EFNA1, and 
EYA2), epithelial morphogenesis (KRTS, TP63, EpCAM, 
KRT14, EC AD, ITGB4, and EGFR), and endoderm specifica- 
tion (ALB and AFP). 

Keratin-5 and Keratin-8 have been detected within the 
human thymus as markers of the medulla and cortex, 
respectively (Laster et al., 1986; Shezen et al., 1995). Our 
microarray analysis demonstrated high coexpression of 
Keratin-5 and its heterodimerization partner Keratin-14 
within FOXNl + (GFP + EpCAM + ) cells (Figure 3B). The 
former result was confirmed by quantitative real-time 
PCR (Figure S4C). Interestingly, although also detected in 
GFP + EpCAM + cells, Keratin-8 is preferentially expressed 
by the GFP~EpCAM + subpopulation (Figure 3A; Fig- 
ure S4C). These results might suggest that the FOXNl + cells 
generated in our protocol more closely reflect a medullary 
phenotype. Finally, FOXNl + cells were devoid of genes 
that mark the parathyroid gland, including GCM2, PTH, 
and CaSR (Figures S4D and S4E), consistent with the obser- 
vation that these two developmentally related tissues can 
be delineated on the basis of FOXN1 and GCM2 expression 
(Gordon et al., 2001). Similarly, other genes associated with 
the demarcation of thymic versus parathyroid primordia 
(BMP4, BMPR1A, BMPR2, and NOGGIN) were not restricted 
to specific subpopulations (Figure S4E). 

Examination of microarray data indicated that a number 
of transcription factors, signaling molecules, and structural 
proteins implicated in TEC differentiation were preferen- 
tially expressed in GFP + EpCAM + cells (Figure 3B). We 
were particularly interested in identifying transmembrane 
proteins for which well-characterized cell-surface anti- 
bodies were readily available. Such antibodies could then 
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Figure 2. Activin A and KGF Induce GFP Expression in Differentiating F0XN1 GFP/W hESCs 

(A) Flow cytometric analysis of differentiating HES3 -derived F0XNl GFP/w hESCs shows F0XN1-GFP + cells emerge at day 15 and subsequently 
increase thereafter. F0XN1-GFP + cells coexpress the thymic epithelial marker EpCAM, but not the mesodermal marker PDGFRa. Flow 
cytometric analysis of a second independent MELl-derived F0XNl GFP/w hESC line demonstrated that the time course of GFP and EpCAM 
induction was comparable between HES3- and MELl-derived lines. 

(B) Quantitative real-time PCR analysis showed that F0XN1 expression was confined to the GFP + EPCAM + fraction, with expression absent 
from the GFP~EpCAM + and GFP~EpCAM" subpopulations. Aggregated GFP + EpCAM + cells retained GFP expression for up to 3 weeks. 
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Figure 3. Microarray Analysis of F0XN1-GFP + Thymic Epithelial Progenitor Cells 

(A) Comparison of gene expression in day 30 GFP + EpCAM + and GFP~EpCAM + cells. Selected differentially regulated key genes are listed 
along with the fold change in gene expression between the two populations. 

(B) Genes associated with endoderm commitment and thymus development are preferentially expressed in GFP + EpCAM + cells compared to 
GFP~EpCAM + and GFP~EpCAM~ cells. The cell-surface marker ITGB4 was among the transcripts upregulated. 



be employed to isolate FOXNl + cells from differentiating 
cultures of genetically unmodified PSC lines. One such 
gene upregulated in GFP + EpCAM + cells was integrin-^4 
(ITGB4 or CD104) (Figure 3B). The ITGB4 subunit 
commonly associates with the integrin-oc6 (ITGA6) subunit 
to form an integrin a6p4 heterodimer that functions as a 
receptor for laminins within the extracellular matrix (Gian- 
cotti, 1996). ITGB4 is expressed in normal human TECs, 
where it is thought to activate intracellular signaling path- 
ways that regulate production of the thymic cytokine IL-6, 
which in turn affects TEC differentiation and thymocyte 
homing (Mainiero et al., 2003). Moreover, ITGB4 is also 
expressed by cultured human TEC lines (Beaudette-Zlata- 
nova et al., 201 1). Time course analysis showed low expres- 
sion of ITGB4 in early differentiating embryoid bodies, 
which preceded the appearance of FOXNl-GFP + cells, 
and was associated with a population that concomitantly 
upregulated EpCAM expression (Figure 4A). Indeed as 
differentiation proceeded, by day 15 the majority of 
EpCAM + cells coexpressed ITGB4, consistent with the 
expression of these markers on a broad range of epithelial 
tissues (Figure 4A). EpCAM + , FOXNl-GFP + cells appearing 
from differentiation day 20 onward expressed high levels 
of ITGB4 (ITGB4 hi ). Furthermore, as the number of 
FOXNl-GFP + cells increased in response to KGF treatment, 
GFP expression became more clearly associated with the 
ITGB4 hl EpCAM + subpopulation. Quantitative real-time 
PCR analysis of FACS-purified day 34 cells derived from 
the differentiation of FOXNl GFP/w hESCs confirmed 
that FOXN1 gene expression was largely confined to the 
ITGB4 hl EpCAM + population and that expression levels 



were comparable to those seen in cells sorted on the 
basis of GFP expression alone (Figure 4B). Negligible 
FOXN1 expression was observed in sorted populations 
that expressed intermediate or low levels of ITGB4 or 
were EpCAM - . 

To explore the utility of these cell-surface makers for 
monitoring developing TECs, we examined other human 
pluripotent stem cell lines, including MEL 1 -derived 
FOXNl GFP/w hESCs, genetically unmodified H9 hESCs 
(Thomson et al., 1998) and DF19-9-7T human iPSCs (Yu 
et al., 2009) (Figure 4B). All human PSC lines generated 
ITGB4 hi EpCAM + cells with parallel kinetics, although the 
proportion of this population varied between different 
cell lines. Cell populations that expressed ITGB4 and 
EpCAM but were not ITGB4 hi (designated ITGB4 + EpCAM + ) 
varied in composition between different PSC lines. Never- 
theless, we found FOXN1 expression to be solely confined 
to those EpCAM + cells that were ITGB4 hi (Figures 4B and 
4C). Overall, flow cytometric analysis of HES3 and MEL1 
FOXNl GFP/w cultures indicated that FOXNl-GFP + cells 
comprised over 80% of the ITGB4 hi EpCAM + fraction (Fig- 
ure S4F). Conversely, 95% of the GFP + population was 
found to be ITGB4 hl EpCAM + , testifying to the close associ- 
ation between these markers and FOXN1 expression 
(Figure S4G). 

In addition to those markers identified by transcriptional 
profiling, we examined the expression of the TEC-associ- 
ated receptor, HLA-DR: a human major histocompatibility 
complex class II (MHCII) cell-surface receptor encoded by 
the human leukocyte antigen (HLA) complex (Walsh 
et al., 2003). Within the thymus, the primary function of 
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Figure 4. Integrin-|34 and EpCAM Identify F0XN1 + Thymic Epithelial Progenitor Cells 

(A) Time course analysis of differentiating HES3-derived F0XNl GFP/w cells demonstrated that F0XN1-GFP + cells could be identified on the 
basis of integrin-p4 (ITGB4) and EpCAM expression. GFP expression was localized to EpCAM + cells that expressed high levels of ITGB4. 
EpCAM + cells are shown gated in light blue, whereas GFP + cells are shown gated in green. Numbers indicate the percentage of cells in the 
corresponding quadrants. 

(B) Quantitative real-time PCR analysis confirmed that F0XN1 transcripts were restricted to the ITGB4 hl EpCAM + population differentiated 
from HES3-derived F0XNl GFP/w cells and were present at levels similar to those observed in cells isolated on the basis of F0XN1-GFP 
expression alone. F0XN1 expression in differentiating MELl-derived F0XNl GFP/w cells and H9 hESCs or DF19-9-7T human iPSCs was also 
confined to the ITGB4 hl EpCAM + population. Colored boxes represent the gating strategy used to isolate subpopulations for flow cytometric 
sorting. I, ITGB4; E, EpCAM. — , +, and hi indicate negative, positive, and high levels of expression, respectively. 

(C) Histograms demonstrating the summary of raw (left panel) and normalized (right panel) data from individual quantitative real-time PCR 
experiments shown in Figure 4B (n = 4). Data are shown as the mean ± SEM. p values were calculated using a two-tailed Student's t test. 

HLA-DR, similar to other MHCII genes, is to present peptide unique in that they constitutively express MHCII without 
antigens during positive and negative T cell selection in the the need for interferon-y (IFN-y) stimulation (Drukker 
thymic cortex and medulla (Yang et al., 2006). TECs are et al., 2002; Yang et al., 2006), consistent with the fact 
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Figure 5. HLA-DR and EpCAM Identify PSC-Derived F0XN1 + Thymic Epithelial Progenitor Cells 

(A) Time course analysis of differentiating HES3-derived F0XNl GFP/w cells demonstrated that F0XN1-GFP + cells could be identified on the 
basis of coexpression of HLA-DR and EpCAM. EpCAM + cells are shown gated in light blue, whereas GFP + cells are shown gated in green. 
Numbers indicate the percentage of cells in the corresponding quadrants. 

(B) Quantitative real-time PCR analysis confirmed that F0XN1 transcripts were substantially enriched in the HLA-DR + EpCAM + population 
differentiated from HES3-derived F0XNl GFP/w cells and were present at levels similar to those observed in F0XN1-GFP + cells. F0XN1 
expression in differentiating MELl-derived F0XNl GFP/w cells and H9 hESCs was also enriched in the HLA-DR + EpCAM + population. Colored 
boxes represent the gating strategy used to isolate subpopulations for flow cytometric sorting. H, HLA-DR; E, EpCAM. 

(C) Histograms demonstrating the summary of raw (left panel) and normalized (right panel) data from individual quantitative real-time 
PCR experiments shown in Figure 5B (n = 3). Data are shown as the mean ± SEM. p values were calculated using a two-tailed Student's 
t test. 



that IFN-y-deficient mice undergo normal thymic develop- 
ment (Dalton et aL, 1993). To explore whether HLA-DR 
could also identify FOXNl-GFP + cells, we performed flow 
cytometry to monitor the expression of HLA-DR and 
EpCAM over a 40 day differentiation time course (Fig- 
ure 5A). Similar to previous results (Drukker et aL, 2002), 
HLA-DR was not expressed on the surface of undifferenti- 
ated hESCs (data not shown) or during the early stages of 
embryoid body formation. Expression was first observed 
from approximately day 15 and increased thereafter. By 
day 40, we detected a distinct population of HLA-DR + 
EpCAM + cells that contained the majority of FOXN1- 



GFP + cells (Figure 5A). Furthermore, all HLA-DR + cells 
were also EpCAM + , suggesting that HLA-DR is restricted 
to only thymic epithelial cells in our culture system. 
Consistent with past studies, HLA-DR expression on 
FOXNl-GFP + TEC progenitors was not upregulated in 
response to IFN-y stimulation. Conversely, as expected, 
IFN-y treatment induced expression of HLA-DR in the 
non-GFP population (Figure S4H). Finally, quantitative 
real-time PCR analysis was performed on day 30 differenti- 
ated cells purified on the basis of HLA-DR and EpCAM 
expression (Figure 5B). This analysis showed that FOXN1 
transcripts were greatly enriched in the HLA-DR + EpCAM + 
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fraction, with expression levels comparable to those seen in 
cells sorted on the basis of GFP alone. The veracity of 
this result was confirmed in similar analyses of HLA-DR + 
EpCAM + cells isolated from differentiating cultures of 
MEL1 FOXNl GFP/w and H9 hESC lines (Figures 5B and 
5C). Similar to results obtained for cells isolated on the 
basis of ITGB4 and EpCAM expression, flow cytometric 
analysis indicated FOXNl-GFP + cells comprised over 70% 
of cells within the HLA-DR + EpCAM + fraction (Figure S4I). 
The clear association of FOXN1 expression with HLA-DR 
further substantiates the thymic identity of cells generated 
using our protocol. 

The upregulation of HLA-DR in conjunction with gene 
expression analyses indicating the presence of NOTCH 
signaling receptors and ligands prompted us to test 
whether FOXNl-GFP + cells generated with our differ- 
entiation protocol might be competent to promote the 
development of T lineage cells from human CD34 + 
hematopoietic progenitors. To test this hypothesis, we 
cocultivated purified FOXNl-GFP + with human umbilical 
cord blood-derived CD34 + CD7 + T lineage progenitors, a 
cell type previously shown to differentiate into CD4 + or 
CD8 + T cells (Awong et al., 2009; La Motte-Mohs et al, 
2005). Over a 4 week coculture period, we analyzed the 
hematopoietic component for expression of CD la, CD5, 
CD4, CD8, and CD3, markers of immature T lineage 
progenitors, differentiating proT-cells and mature T cells. 
Although we observed the induction of CDla, CD5, CD4, 
and CD8 expression, the differentiation step was not 
specifically dependent on the presence of FOXNl-GFP + 
cells (Figures S5A-S5C and S5E). Moreover, we did not 
observe convincing expression of the T-lineage-specific 
marker CD3 (Figure S5D). Rather, a significant proportion 
of CD34 + cells differentiated into cells with a myeloid 
phenotype (CD45 + CD14 + ) (Figure S5E). Hence, although 
our human PSC-derived cells possess a TEC-like phenotype, 
we believe that they represent a stage that is too immature 
to drive T cell differentiation. Further investigations into 
the growth factors necessary for the commitment of 
FOXNl + TEC progenitors into mature TECs in our culture 
system are therefore required. 



DISCUSSION 

We devised a simple method for generating and isolating 
FOXNl + thymic epithelial progenitor cells differentiated 
from human pluripotent stem cells. We inserted a GFP 
reporter gene into the locus encoding the key thymic tran- 
scription factor FOXN1, to facilitate monitoring of the early 
steps involved in human thymus specification. This 
reporter gene faithfully identified cells expressing FOXN1 
and enabled isolation of FOXNl + thymic progenitors 



from heterogeneous differentiation cultures. Although 
targeted lines represent an extremely useful research tool, 
genetically manipulated lines are undesirable for potential 
future clinical use. Moreover, the technical requirements 
for genetic modification make the application of this 
approach impractical for the ever-expanding number of 
available pluripotent stem cell lines. With these points in 
mind, we interrogated the expression profile of FACS-puri- 
fied FOXNl-GFP + cells through microarray analysis with 
the aim of identifying genes that could serve as a surrogate 
marker of FOXN1 expression. From this screen, we identi- 
fied ITGB4 as a cell-surface marker that, in conjunction 
with EpCAM, could be used to isolate FOXNl + cells from 
genetically unmodified hESCs and from human iPSCs. 
Similarly, our analysis showed that HLA-DR and EpCAM 
were also specific markers of FOXNl + TEC progenitors. 
Taken together, our study provides a model system to 
investigate the signaling pathways required for TEC 
commitment and differentiation. 

Gene expression analysis revealed that cells differenti- 
ated for up to 30 days expressed markers implicated in 
endoderm specification and TEC development but lacked 
the expression of important late-stage TEC markers asso- 
ciated with functional maturation. In particular, relative 
to human pediatric thymic stroma (data not shown), 
in vitro-derived FOXNl + TEC progenitors showed low 
expression of autoimmune regulator (AIRE), a gene critical 
for intrathymic expression of tissue-restricted antigens, 
which in turn is required to induce tolerance to peripheral 
antigens (Anderson et al., 2007). In addition, CD80, a 
costimulatory molecule expressed on functionally mature 
mTECs (Derbinski et al., 2005), was also not enriched on 
FOXNl + cells. These data indicated that the thymic endo- 
derm produced using our culture conditions most likely 
represented an early stage of ontogeny in which overt 
thymic functional characteristics were yet to be acquired. 

The latter stages of thymic epithelial development are 
dependent on the presence of hematopoietic cells that, in 
the mouse, infiltrate the thymic primordium at embryonic 
day 12, soon after the onset of FOXN1 expression. Sub- 
sequent to this, the thymic endoderm and lymphoid 
progenitors engage in signaling crosstalk that ultimately 
yields functionally mature TECs and educated T cells. The 
Notch-Delta system is a central signaling mechanism 
implicated in directing this crosstalk (Mohtashami et al., 
2010). In the thymus, Notchl expressed on hematopoietic 
progenitors interacts with Dill or DU4 on TECs to either 
maintain or induce T-lineage commitment and differentia- 
tion. Because of this, we tested whether coculture of 
FOXNl + thymic endoderm with hematopoietic progenitor 
cells could provide an environment whereby both cell 
types could undergo further maturation. However, under 
the conditions used, we saw no evidence of ongoing 
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Tcell differentiation. Moreover, analysis of these cocultures 
suggested that the viability of thymic endoderm cells 
was compromised, suggesting that further work will be 
required to define conditions that are permissive for 
the continued growth and development of both hemato- 
poietic and endodermal cells types. 

Recently, Parent et al. (2013) and Sun et al. (2013) 
reported methods for the generation of thymic endoderm 
from pluripotent stem cells. Following induction of defini- 
tive endoderm by Activin A treatment, both groups could 
promote the appearance of anterior foregut and subse- 
quently pharyngeal endoderm by carefully choreographed 
manipulation of WNT, retinoic acid, and BMP4 signaling, 
mirroring the inductive and repressive actions of these 
pathways during early thymic ontogeny. Both studies 
showed that mixed differentiation cultures containing 
thymic progenitors could instruct T cell development in 
xenotransplantation models. In our differentiation proto- 
col, the activities of the WNT, retinoic acid, and BMP4 
pathways were not deliberately manipulated with exoge- 
nous growth factors or inhibitors, potentially explaining 
the inability of our FOXNl + cells to upregulate markers of 
late-stage TEC differentiation or to support T cell develop- 
ment. Having said this, we did not formally test whether 
heterogeneous non-FACS-purified populations containing 
FOXNl + cells could differentiate further following trans- 
plantation or indeed support T cell development in vitro. 
Consequently, the potential functionality and differentia- 
tion status of the cells generated using our simple method 
is difficult to directly compare with cells described by 
Parent et al. (2013) and Sun et al. (2013), where mixed 
cultures were used for in vivo studies. 

The cell lines and surface markers reported in this study 
provide a facile platform on which to begin the process 
of identifying key events and critical cell populations 
required to derive fully functional TECs from PSCs. These 
FOXNl GFP/w reporter lines represent a unique research 
tool to analyze human thymus development, whereas the 
cell-surface marker combinations recognized by antibodies 
against ITGB4, HLA-DR, and EpCAM can be used to purify 
TEC progenitors from cultures of differentiating un- 
manipulated PSCs. Although the exogenous growth factors 
required for ongoing differentiation of FOXNl + progenitor 
cells into functionally mature TECs remain to be estab- 
lished, our study contributes critical tools for efforts to 
generate this clinically important cell type. 

EXPERIMENTAL PROCEDURES 

Construction of the FOXN1 -Targeting Vector 

The FOXN1 -targeting vector was assembled using standard cloning 
techniques and Gene Bridges Red/ET Recombination technology 
(see the Supplemental Experimental Procedures). The final 



18.2 kb FOXN1 -targeting vector comprised a 9.6 kb 5' homology 
arm, a GFP-coding sequence, a loxP-flanked neomycin resistance 
gene driven by the mouse phosphoglycerate kinase (PGK) gene 
promoter, a 3.9 kb 3' homology arm, and a pBR322-based plasmid 
backbone. The targeting vector was linearized by Swal digestion 
prior to electroporation. 

Generation, Identification, and Characterization of 
Targeted FOXNl GFP/w hESCs 

The FOXN1 -targeting vector was electroporated into hESCs as 
previously described (Costa et al., 2007) and as detailed in the 
Supplemental Experimental Procedures. The loxP-flanked neo R 
cassette was removed by Cre recombinase-mediated excision as 
previously described (Davis et al., 2008). For one of the two 
independently derived targeted hESC lines (HES3 FOXNl GFP/w 
and MEL1 FOXNl GFP/w ), Southern blot analysis with a labeled 
GFP probe (Figure 1A) was used to confirm the presence of a 
single-integration event (Figure IB). Karyotype analysis was 
performed by the Cytogenetics Department at Southern Cross 
Pathology, Monash Medical Centre. Teratoma formation and 
analysis were performed as previously described (Costa et al., 
2005). Animal experiments were conducted under the approval 
of the Monash University Animal Ethics Committee (number 
SOBSA/MIS/2009/07). 

hESC and iPSC Culture and Differentiation 

hESC lines (HES3, MEL1, and H9 [Thomson et al., 1998]) and the 
induced pluripotent stem cell (iPSC) line DF19-9-7T (Yu et al., 
2009) were maintained as previously described (Costa et al., 
2008). hESCs and iPSCs were differentiated as spin embryoid 
bodies in serum-free media as previously described (Ng et al., 
2008). Briefly, 1 day prior to differentiation, cells were passaged 
onto a new tissue culture flask seeded with low density (1 x 10 4 
cells/cm 2 ) mouse embryonic fibroblasts. At day 0, cells were 
harvested and deposited into each well (3 x 10 3 cells/well) of a 
96-well round-bottom nonadherent plate (Nunc) and briefly 
centrifuged to promote cell aggregation. 

Heatmap experiments were performed using APEL medium (Ng 
et al., 2008), supplemented with 0-320 ng ml" 1 Activin A (R&D 
Systems) and 0-320 ng ml -1 bone morphogenetic protein-4 
(BMP4; R&D Systems) in 96-well round-bottom low-attachment 
plates (Costar) (Elliott et al., 2011). Cytokines were removed on 
day 5; at day 7, embryoid bodies were transferred to 96-well flat- 
bottom adherent tissue culture-treated plates (BD Falcon) contain- 
ing APEL medium lacking PVA (AEL medium) (Ng et al., 2008). 
After 30-35 days of differentiation, embryoid bodies were dissoci- 
ated with TrypLE Select (Invitrogen), stained with the relevant 
antibodies, and analyzed with FACSDIVA software (BD Biosci- 
ences) on either the BD LSRII or BD LSRFortessa cell analyzers 
(BD Biosciences), fitted with a BD high-throughput sampler 
(HTS) module. 

For thymic endoderm differentiation assays, cells were initially 
cultured in APEL or BPEL medium (Ng et al., 2008), supplemented 
with 150 ng ml" 1 human Activin A. On day 5, Activin A-contain- 
ing APEL or BPEL medium was replaced with APEL or BPEL me- 
dium alone. On day 7, EBs were transferred to 96-well flat-bottom 
adherent plates (BD Falcon) in AEL or BEL medium (media lacking 
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PVA) (Ng et al., 2008). AEL or BEL medium supplemented with 
40 ng ml -1 human keratinocyte growth factor (KGF; R&D Systems) 
was used to replenish the cultures on days 14, 21, 28, and 35. 
Embryoid bodies were harvested for analysis by flow cytometry 
at the times indicated. For experiments using MIXLl GFP/w hESCs 
(Davis et al., 2008), cells were differentiated in BPEL medium sup- 
plemented either with 150 ng ml -1 Activin A, 20 ng ml -1 human 
BMP4, and 100 ng ml" 1 Activin A, or with 100 ng ml" 1 FGF2 
(Peprotech). In all instances, hESC and iPSC cultures and differen- 
tiations were maintained at 37°C, in a 5% C0 2 /air environment. 

Flow Cytometric Analysis and Sorting 

For analysis and sorting of live cells, hESCs and embryoid bodies 
were dissociated to a single-cell suspension using TrypLE Select 
(Invitrogen), filtered through a 35 uM cell-strainer cap (BD Falcon), 
and labeled with the appropriate antibodies (see the Supplemental 
Experimental Procedures) as previously described (Davis et al., 
2008). Flow cytometry was performed using a FACSCalibur, 
FACSDiva, or Influx Cell Sorter (all from BD Biosciences). 
Flow cytometric gates were set using unmodified hESCs or 
FOXN1 Gi7p/w -targeted cells labeled with an appropriate isotype 
control antibody. Live cells were identified on the basis of forward 
scatter, side scatter, and propidium iodide (PI) exclusion. 

Intracellular flow cytometry using mouse anti-human OCT4 
(clone C-10; Santa Cruz Biotechnology) and mouse anti-human 
FOXN1 (clone E-3; Santa Cruz Biotechnology) antibodies was per- 
formed essentially as described by Davis et al. (2008) and as 
detailed in the Supplemental Experimental Procedures. 

Culture of Sorted Cell Aggregates 

Cells purified by flow cytometry were aggregated by centrifugation 
for 5 min at 1,500 rpm, using the spin EB protocol (5-10 x 10 3 
cells/well) in BPEL medium supplemented with 5 uM Y27632 
ROCK inhibitor, as previously described (Goulburn et al., 2011). 
Aggregated cell clusters were cultured on gelatin-coated wells of a 
96-well flat-bottom adherent plate in BEL medium supplemented 
with 40 ng ml" 1 KGF. 

Quantitative Real-Time PCR and Microarray Gene 
Expression Analysis 

Total RNA was prepared using a High Pure RNA Isolation Kit 
(Roche), in accordance with the manufacturer's instructions. 
Quantitative real-time PCR was performed essentially as described 
by Pick et al. (2007), with probes detailed in the Supplemental 
Experimental Procedures. For microarray analysis, RNA samples 
were amplified, labeled, and hybridized to the Human WG-6 
(v. 3.0) BeadChip and the Human HT-12 (v. 3.0) BeadChip (Illu- 
mina) at the Australian Genome Research Facility (The Walter 
and Eliza Hall Institute of Medical Research). Data were analyzed 
using GenomeStudio Gene Expression Module (v. 1.5.4) (Illumina) 
using average normalization across all samples, with additional 
analysis performed using GeneSpring GX software (Agilent Tech- 
nologies), as previously described (Goulburn et al., 2011). 

IFN-y Stimulation Assay 

Human IFN-y (R&D Systems) at 10 ng ml" 1 was added to day 30 
thymic differentiation cultures in BEL medium. After 72 hr of 



IFN-y stimulation, differentiated cells were harvested and analyzed 
for HLA-DR expression by flow cytometry. 

CD34 + CD7 + proT-Cell and FOXNl-GFP + TEC 
Progenitor Coculture Assays 

Umbilical cord blood (UCB) mononuclear cells were obtained 
and processed as previously described (La Motte-Mohs et al., 
2005). They were pre-enriched into lineage-negative (Lin") 
fractions, sorted into CD34 + hematopoietic stem cells, and 
cultured on OP9-DL1 cells for 9-10 days, as previously described 
(Awong et al., 2009). CD45 + CD34 + CD7 + proT-cells were FACS 
purified and either cultured alone (1 x 10 3 cells/well) or with 
FOXNl-GFP + (3.5 x 10 3 cells/well) or GFP" (3.5 x 10 3 cells/well) 
cells in 96-well round-bottom low-attachment plates (Costar), 
containing APEL medium supplemented with 20% FBS, rhFLT3L 
(5 ng ml" 1 , Peprotech), rhIL7 (5 ng ml" 1 , Peprotech), and rhSCF 
(30 ng ml" 1 , Peprotech). A half-media change was performed every 
3-4 days. Cultures were analyzed by flow cytometry using the indi- 
cated antibodies and as detailed in the Supplemental Experimental 
Procedures. 
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